How SVMs can estimate quantiles and the median
نویسندگان
چکیده
We investigate quantile regression based on the pinball loss and the ǫ-insensitive loss. For the pinball loss a condition on the data-generating distribution P is given that ensures that the conditional quantiles are approximated with respect to ‖ · ‖1. This result is then used to derive an oracle inequality for an SVM based on the pinball loss. Moreover, we show that SVMs based on the ǫ-insensitive loss estimate the conditional median only under certain conditions on P .
منابع مشابه
Early prediction of median survival among a large AIDS surveillance cohort
BACKGROUND For individuals with AIDS, data exist relatively soon after diagnosis to allow estimation of "early" survival quantiles (e.g., the 0.10, 0.15, 0.20 and 0.30 quantiles, etc.). Many years of additional observation must elapse before median survival, a summary measure of survival, can be estimated accurately. In this study, a new approach to predict AIDS median survival is presented and...
متن کاملThe use of Quantiles of Auxiliary Variables to Estimate Medians
The problem of estimating a population mean in the presence of an auxiliary variable has been widely discussed in the finite population sampling literature. For the problem of estimating a population median, however, the situation is quite different and only recently has this problem been discussed. Chambers and Dunstan (1986) proposed a method for estimating the population distribution functio...
متن کاملEXTREMAL QUANTILE REGRESSION 3 quantile regression
Quantile regression is an important tool for estimation of conditional quantiles of a response Y given a vector of covariates X. It can be used to measure the effect of covariates not only in the center of a distribution, but also in the upper and lower tails. This paper develops a theory of quantile regression in the tails. Specifically , it obtains the large sample properties of extremal (ext...
متن کاملNovel Algorithms for Computing Medians and Other Quantiles of Disk-Resident Data
In data warehousing applications, numerous OLAP queries involve the processing of holistic operations such as computing the "top N", median, etc. Efficient implementations of these operations are hard to come by. Several algorithms have been proposed in the literature that estimate various quantiles of disk-resident data. Two such recent algorithms are based on sampling. In this paper we presen...
متن کاملNoninformative nonparametric quantile estimation for simple random samples
For noninformative nonparametric estimation of finite population quantiles under simple random sampling, estimation based on the Polya posterior is similar to estimation based on the Bayesian approach developed by Ericson (1969, JRSSB, 31, 195-233) in that the Polya posterior distribution is the limit of Ericson’s posterior distributions as the weight placed on the prior distribution diminishes...
متن کامل